Pre-reordering Model of Chinese Special Sentences for Patent Machine Translation
نویسندگان
چکیده
Chinese prepositions play an important role in sentence reordering, especially in patent texts. In this paper, a rule-based model is proposed to deal with the long distance reordering of sentences with special prepositions. We firstly identify the prepositions and their syntax levels. After that, sentences are parsed and transformed to be much closer to English word order with reordering rules. After integrating our method into a patent MT system, the reordering and translation results of source language are effectively improved.
منابع مشابه
Sentence Type Based Reordering Model for Statistical Machine Translation
Many reordering approaches have been proposed for the statistical machine translation (SMT) system. However, the information about the type of source sentence is ignored in the previous works. In this paper, we propose a group of novel reordering models based on the source sentence type for Chinese-toEnglish translation. In our approach, an SVM-based classifier is employed to classify the given...
متن کاملUsing unlabeled dependency parsing for pre-reordering for Chinese-to-Japanese statistical machine translation
Chinese and Japanese have a different sentence structure. Reordering methods are effective, but need reliable parsers to extract the syntactic structure of the source sentences. However, Chinese has a loose word order, and Chinese parsers that extract the phrase structure do not perform well. We propose a framework where only POS tags and unlabeled dependency parse trees are necessary, and ling...
متن کاملLearning Bilingual Linguistic Reordering Model for Statistical Machine Translation
In this paper, we propose a method for learning reordering model for BTG-based statistical machine translation (SMT). The model focuses on linguistic features from bilingual phrases. Our method involves extracting reordering examples as well as features such as part-of-speech and word class from aligned parallel sentences. The features are classified with special considerations of phrase length...
متن کاملWord Order Does NOT Differ Significantly Between Chinese and Japanese
We propose a pre-reordering approach for Japanese-to-Chinese statistical machine translation (SMT). The approach uses dependency structure and manually designed reordering rules to arrange morphemes of Japanese sentences into Chinese-like word order, before a baseline phrase-based (PB) SMT system applied. Experimental results on the ASPEC-JC data show that the improvement of the proposed pre-re...
متن کاملLocal Phrase Reordering Model for Chinese-English Patent Machine Translation
We focused on when and how to reorder the Chinese NPs with two, three translation units for reordering in Chinese-English Machine Translation. By analyzing the features of translation units of the Chinese NPs, we built some formalized rules to recognize the boundaries of translation units using the boundary words to recognize what to reorder. By comparing the orders of Chinese and English NPs, ...
متن کامل